1 A NONPARAMETRIC MULTICLASS PARTITIONING METHOD FOR CLASSIFICATION by SAUL BRIAN GELFAND
نویسنده
چکیده
c classes are characterized by unknown probability distributions. A data sample containing labelled vectors from each of the c classes is available. The data sample is divided into test and training samples. A classifier is designed based on the training sample and evaluated with the test sample. The classifier is also evaluated based on its asymptotic properties as sample size increases. A multiclass recursive partitioning algorithm which generates a single binary decision tree for classifying all classes is given. The algorithm has the same desirable statistical and computational properties as Friedman's (1977) 2-class algorithm. Prior probabilities and losses are accounted for. A tree termination algorithm which terminates binary decision trees in a statistically optimal manner is given. GorT-don and Olshen's (1978) results on the asymptotic Bayes risk efficiency of 2-class recursive partitioning algorithms are extended to the c-class case and applied to the combined partitioning/termination algorithm. Asymptotic efficiency and consistent risk estimates are obtained with independent test and training sequences. 3 ACKNOWLEDGEMENT I would like to thank Professor Sanjoy Mitter for his intellectual and financial support throughout the course of this research. I would also like to acknowledge Don Gustafson whose ideas contributed heavily to the initial phase of the work.
منابع مشابه
2 a Nonparametric Multi Class Partitioning Method for Classification
c classes are characterized by unknown probability distributions. A data sample containing labelled vectors from each of the c classes is available. The data sample is divided into test and training samples. A classifier is designed based on the training sample and evaluated with the test sample. The classifier is also evaluated based on its asymptotic properties as sample size increases. A mul...
متن کاملمدل های پاسخ چند جمله ای، به منظور مدل بندی و تعیین عوامل موثر در انواع روش های استفاده شده برای جلوگیری از بارداری زنان
Difference aspects of multinomial statistical modelings and its classifications has been studied so far. In these type of problems Y is the qualitative random variable with T possible states which are considered as classifications. The goal is prediction of Y based on a random Vector X ? IR^m. Many methods for analyzing these problems were considered. One of the modern and general method of cla...
متن کاملMulti-borders classification
The number of possible methods of generalizing binary classification to multi-class classification increases exponentially with the number of class labels. Often, the best method of doing so will be highly problem dependent. Here we present classification software in which the partitioning of multiclass classification problems into binary classification problems is specified using a recursive c...
متن کاملتحلیل ممیز غیرپارامتریک بهبودیافته برای دستهبندی تصاویر ابرطیفی با نمونه آموزشی محدود
Feature extraction performs an important role in improving hyperspectral image classification. Compared with parametric methods, nonparametric feature extraction methods have better performance when classes have no normal distribution. Besides, these methods can extract more features than what parametric feature extraction methods do. Nonparametric feature extraction methods use nonparametric s...
متن کاملGrid Base Classifier in Comparison to Nonparametric Methods in Multiclass Classification
In this paper, a new method known as Grid Base Classifier was proposed. This method carries the advantages of the two previous methods in order to improve the classification tasks. The problem with the current lazy algorithms is that they learn quickly, but classify very slowly. On the other hand, the eager algorithms classify quickly, but they learn very slowly. The two algorithms were compare...
متن کامل